**********************************************
This folder contains the following code files:
**********************************************

1) script_Sampling_of_24000_tweets.R

This short script reproduces the numerical results reported in the paragraph starting with "As a first step..." in the section called "Design of content analysis". Specifically, the script samples 24,000 tweets.

Version of R: 4.0.2

2) script_All_remaining_analyses.do

This dofile reproduces all other numerical results, including figures and tables.

Version of Stata: Stata/IC 16.1


*********************************************
This folder contains the following log files:
*********************************************

1) log_Stata.log

Log for "script_All_remaining_analyses.do"


********************************************
This folder contains the following datasets:
********************************************

1) data_All_tweets_from_which_24000_are_sampled.xlsx

Dataset containing tweets posted by members of Congress between September 5 2017 and July 26 2018.

2) data_IDs_of_randomly_sampled_tweets.xlsx

Dataset containing the identification numbers of the 24,000 tweets that were randomly sampled.

3) data_Content_analysis_data_from_first_round.dta

Dataset containing data from the first round of crowdsourced content analysis. Each line corresponds to a rating by a coder.

4) data_Content_analysis_data_from_second_round.dta

Dataset containing data from the second round of crowdsourced content analysis. Each line corresponds to a rating by a coder.

5) data_Experimental_data.dta

Dataset with data from survey experiments.


********************************************
This folder contains the following codebook:
********************************************

1) codebook.xlsx

Codebooks for all five datasets.